Intelligent Information Access from Scientific Papers
نویسندگان
چکیده
We describe a novel search engine for scientific literature. The system allows for sentence-level search starting from portable document format (PDF) files, and integrates text and image search, thus facilitating the retrieval of information present in tables and figures. It allows the user to generate in an intuitive manner complex queries for search terms that are related through particular grammatical (and thus implicitly semantic) relations. The system uses grid processing to parallelise the analysis of large numbers of scientific papers. It is currently undergoing user evaluation, but we report some preliminary evaluation and comparison with Google Scholar, demonstrating its utility. Finally, we discuss future work and the potential and complimentarity of the system for patent search.
منابع مشابه
Natural Language Processing for Intelligent Access to Scientific Information
During the last decade the amount of scientific information available on-line increased at an unprecedented rate. As a consequence, nowadays researchers are overwhelmed by an enormous and continuously growing number of articles to consider when they perform research activities like the exploration of advances in specific topics, peer reviewing, writing and evaluation of proposals. Natural Langu...
متن کاملGuest Editorial: Impact of Integrated Intelligent Information and Analytical Systems on Society
The Special Issue of the Journal of Information Technology Management (JITM) is publishing very selective papers on information management, technology in higher education, integrated systems, enterprise management, cultural thoughts, strategic contributions, management information systems, and cloud computing. We received numerous papers for this special issue but after an extensive pe...
متن کاملCamtology: Intelligent Information Access for Science
We describe a novel semantic search engine for scientific literature. The Camtology system allows for sentence-level searches of PDF files and combines text and image searches, thus facilitating the retrieval of information present in tables and figures. It allows the user to generate complex queries for search terms that are related through particular grammatical/semantic relations in an intui...
متن کاملSemantic-Based Access to Digital Document Databases
Discovering significant meta-information from document collections is a critical factor for knowledge distribution and preservation. This paper presents a system that implements intelligent document processing techniques, by combining strategies for the layout analysis of electronic documents with incremental first-order learning in order to automatically classify the documents and their layout...
متن کاملچهار دهه فعالیت علمی ایران از منظر مقالات همایشها، مقالات پر استناد و داغ و مقالات دسترسی آزاد با نگاهی به قانون برنامه توسعه اقتصادی ، اجتماعی، فرهنگی کشور
This study aims to investigate Iran scientific production Pre-revolutionary by 2016 with the emphasis on the conferences proceedings, highly cited and hot papers, and open access papers, in the light of the Law of Economic, Social, and Cultural Development Plan of Iran. Descriptive – analytical method used. To achieve research objectives data extracted from Clarivate Analytics (Thomson Reuters)...
متن کامل